05:00
2026-05-26
alex.smola.org
large-language-models
You don't need all the LLM benchmarks
A new analysis of over 5,400 AI models reveals that benchmark scores for large language models are highly correlated, with just five subjects on the MMLU test predicting the remaining 52 with 91% accu…